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Abstract —The recognition of color texture under varying lighting con¬ 
ditions is still an open issue. Several features have been proposed for 
this purpose, ranging from traditional statistical descriptors to features 
extracted with neural networks. Still, it is not completely clear under what 
circumstances a feature performs better than the others. In this paper we 
report an extensive comparison of old and new texture features, with and 
without a color normalization step, with a particular focus on how they 
are affected by small and large variation in the lighting conditions. The 
evaluation is performed on a new texture database including 68 sam¬ 
ples of raw food acquired under 46 conditions that present single and 
combined variations of light color, direction and intensity. The database 
allows to systematically investigate the robustness of texture descriptors 
across a large range of variations of imaging conditions. 


1 Introduction 

The role of color in texture classification has been widely 
debated in the literature. Despite the number and the depth 
of the experimental verifications, it is still not completely 
clear how much and under what circumstances color in¬ 
formation is beneficial. Notable examples of this kind of 
analysis are the work by Maenpaa and Pietikainen |l), and 
that by Bianconi et al. 0. They both observed how color 
can be effective, but only in those cases where illumination 
conditions do not vary too much between training and test 
sets. In fact, methods that exploit color information greatly 
suffer variations in the color of the illuminant. Under these 
circumstances the best result is often achieved simply by 
disregarding color, that is, by reducing all the images to 
gray scale. The degree of intra-class variability of the images 
in Fig. [l] suggests why color information, if not properly 
processed, can easily be deceptive. 

A possible strategy to exploit color in texture classi¬ 
fication consists in the extraction of image features that 
are invariant (or at least robust) with respect to changes 
in the illumination. In scene and object recognition the 
approach of specially designing invariant features is rapidly 
becoming obsolete in favor of features automatically learned 
from a large amount of data with methods based on deep 
learning 0. It is not clear if the same is going to happen 
in texture recognition as well. A recent work j4) suggests 
that a hybrid approach (local features extracted from a 
convolutional neural network and then aggregated as Fisher 
vectors) can be the most successful. 


The availability of suitable databases of texture images 
is of primary importance for the research in this field. 
Therefore, in the past several texture databases have been 
collected to asses the performance of texture recognition 
methods under a variety of conditions. These databases 
are often focused on the exploration of the variability of 
texture images under specific variations of imaging condi¬ 
tions m, i6i, mostly related to variations in the geometry 
of the acquisition setup (with little or no variation about 
the characteristics of the illuminant). For instance, several 
texture databases include images where the same samples 
are taken from different point of views. As a result, the 
images depict the same textures taken at different scales 
and orientations. By contrast, a more recent work proposed 
a database of "textures in the wild" 0 to allow texture 
analysis in completely uncontrolled environments. This ap¬ 
proach allows to implicitly verify the robustness against 
a multitude of source of variations simultaneously. Even 
though, the results on this kind of dataset may provide a 
better indication about the 'average' performance of texture 
recognition methods in several real-world applications, they 
do not allow a clear analysis of their strengths and weak¬ 
nesses for specific setups. In fact, there are several applica¬ 
tion domains where acquisition conditions are indeed very 
controlled (medical imaging, industrial inspection...) and 
for which the uncertainty inherent to the experimentation 
in the wild is a serious liability. 

In this paper we address the problem of texture classi¬ 
fication under controlled, large variations of lighting con¬ 
ditions. We have evaluated and compared several texture 
and color descriptors with respect to single and combined 
changes in the lighting conditions. We selected three classes 
of visual descriptors. The first class includes traditional (hand 
crafted) descriptors specially designed for texture analysis. 
The second one includes features that were specially de¬ 
signed for object recognition. The third class includes those 
features that correspond to intermediate representations 
computed by Convolutional Neural Networks (CNNs). 

Since we addressed the problem of texture classification 
under varying lighting conditions, we also investigated the 
use of color normalization methods as a preprocessing, 
so quantifying how much their application influences the 
performance of the different descriptors. 

Existing texture databases 0, 0 do not include, in 




2 


D65 (0=24°) D95 (0=24°) L27(6>=24°) D65(6>=60°) D95(6>=60°) L27(6>=60°) D65(6>=90°) D95(0=9O°) L27(0=9O°) 



(b) 

Fig. 1. Examples of 2 different textures acquired under 9 different light conditions, (a) Salt, (b) Grapefruit. 


general, large variations of lighting conditions and, in par¬ 
ticular, they do not allow to evaluate the goodness of visual 
descriptors with respect to both single and combined light¬ 
ing condition changes, such as only direction or temperature 
of the light, direction and temperature of the light, etc. Due 
to these reasons, we collected a new texture database, that 
we named Raw Food Texture database (RawFooT). This 
database includes several samples of raw food acquired 
under 46 conditions differing in the light color, direction and 
intensity We choose to focus on raw food because, similarly 
to other natural surfaces, its color is an intrinsic property 
Therefore, the task of classifying this kind of textures does 
not include the semantic ambiguities that, instead, may arise 
when dealing with artificial surfaces, where it is possible 
to argue that samples of different color may be instances 
of the same class of "Textures". As far as we know the 
proposed database is the one featuring the largest amount 
of variations in the lighting conditions, and the only one 
where color, direction and intensity of light are subject to 
systematic and independent changes. 

In this paper we address the following issues: 

• How effective are hand-crafted texture descriptors when 
acquisition condition variations are so large? 

• Can object recognition descriptors achieve high classifi¬ 
cation accuracy on pure texture images? 

• Do CNN-based descriptors confirm to be powerful also 
on texture classification tasks? 

• Can CNN-based descriptors handle large variations in 
lighting conditions? 

• Is color normalization helpful for texture analysis in 
case of changes in lighting conditions? 

The rest of the paper is organized as follows: Section [2] 
describes the proposed RawFooT database and compares it 
against other publicly available data sets; Section [3] reviews 
the main texture descriptors in the state of art; Section [4] 
describes the experimental setup and Section [5] reports the 
results obtained; finally. Section [6] presents our final consid¬ 
erations and discusses some new directions for our future 
research on this topic. 

2 Texture databases 

In the last years, different research groups developed a 
number of databases of texture images ranging from natural 
textures to man-made materials m, m, m Each database 


has been designed to study one or several aspects about tex¬ 
tures: invariance to acquisition device, invariance to lighting 
conditions, invariance to image rotation or scale, 3D recon¬ 
struction, computer graphics, classification, segmentation, 
etc. The problems of texture classification and of material 
recognition are closely related. In this paper we mainly 
focused on the former since the two problems may require 
different strategies (9)* 

We considered the most important texture databases 
that have been presented in the literature 0, (6J, and we 
compiled the Table [l] where we have highlighted the most 
important features of each database. Amongst the features, 
the most important are those related to the acquisition setup, 
such as: illumination conditions, sensor's angle, image rotation, 
scaling and color of the illuminant. In particular, we high¬ 
lighted four different sources of variations in illumination 
conditions. One related to the direction of the light, one to 
the intensity of the light, another to the color temperature of 
the light, and the fourth related to a mixture of variations, 
such as: temperature and direction, temperature and inten¬ 
sity, etc. As you can see from the table, several databases 
consider a mixture of variations. The most notable is the 
OuTex database which is, in fact, the most used in 
the study of descriptors invariant to lighting conditions fll . 
The OuTex collection includes the OuTex-14 test suite that 
contains images that depict textures of 68 different classes 
acquired under three different light sources, each positioned 
differently: the 2856 K incandescent CIE A, the 2300 K hori¬ 
zon sunlight and the 4000 K fluorescent TL84. 

Few databases consider variations of light direction, 
intensity or temperature separately. In particular, the only 
database that provides a good number of this kind of vari¬ 
ations is the ALOT database ED This collection provides 
250 classes of textures acquired under several conditions 
obtained combining five illumination directions (at 3075 K) 
and one semi-hemispherical illumination (at 2175 K). Each 
object was recorded with only one out of five lights turned 
on, yielding five different illumination angles. One image 
is recorded with all lights turned on, yielding a sort of 
hemispherical illumination. All the images are acquired by 
four cameras positioned differently. 

As far as we know, no publicly available texture database 
has been designed to assess the performance in texture 
classification under a broad range of variations in the il¬ 
lumination color, direction and intensity. This is why we 
collected the Raw Food Texture database. 
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TABLE 1 

Main features of existing texture databases 03], ft 2]. The filled circle indicates that the given feature is present, the empty circle indicates its 
absence, the minus sign indicates that information on that feature is not available. 


Database 

Type of 
surface 

#Classes 

#Images 

Image 

size 

Color 

repr. 

Rotation 

Scaling 

Light di¬ 
rections 

Light in¬ 
tensity 

Light tem¬ 
perature 

Lighting vari¬ 
ability 

Sensor's 

angle 

Camera sensor 

Brodatz |13l 

natural, 

artificial 

112 classes 
112 images 

512x512 

gray-scale 

• 

0 

0 

0 

0 

• controlled 
unknown 

scanned 

- 

CUReT El 

natural, 

artificial 

61 classes 

over 

14 000 
images 

512x512 

color RGB 

• 

0 

• 55 

0 

0 

0 

7 

positions 

3 separate 

CCD 

VisTex El 

natural, 

artificial 

54 classes 
864 images 

512x512 


0 

0 

• 

unknown 

directions 

0 

0 

• uncon¬ 

trolled: 
daylight, 
fluorescent, 
incandescent 

frontal, 

oblique 


MeasTex |16| 

natural, 

artificial 

4 classes 

944 images 

512x512 

gray-scale 

• 

0 

0 

0 

0 

• daylight di¬ 
rect and indi¬ 
rect, flash 

frontal, 

oblique 

scanned from 
35mm film 

PhoTex El 

rough 

surfaces 

64 classes 

1280 X1024 

gray-scale 

• 

0 

• 4 

0 

0 

0 

12 

positions 

Vosskuhler 

CCD 1300LN 

OuTex El 

natural, 

artificial 

320 classes 

51 840 im¬ 
ages 

516x716 

color RGB 

• 

• 

0 

0 

0 

• fluorescent, 
tl84, 

incandescent 
at 3 positions 

frontal 

3 CCD, Sony 
DXC-755P 

DynTex |18l 

natural, 

artificial 

36 classes 
345 

sequences 

720 X 576 
25fps 

color RGB 

• 

0 

0 

0 

0 

0 


3 CCD, Sony 
DCR-TRV890E 
TRV900E 

uiuc El 

natural, 

artificial 

25 classes 
1000 images 

640x480 

gray-scale 

0 

• 

0 

0 

0 

0 

various 

- 

KTH- 
TIPS2 [20] 

natural, 

food, 

artificial 

44 classes 
4608 images 

1280 X 960 

color RGB 

• 

• 

• 3 

0 

0 

• incan¬ 

descent, 
fluorescent 

frontal, 

oblique 

Olympus 

C-3030ZOOM 

ALOT rm 

natural, 

food, 

artificial 

250 classes, 
over 

27 500 


color RGB 

• 

0 

• 5 

0 

• 2 

0 

4 

positions 

Foveon X3 

3CMOS 

Mondial 

Marmi[22l 

granite 

12 classes 

544x544 

color RGB 

• 

0 

0 

0 

0 

• white HB 
LEDs 

frontal 

Samsung S850 

STex f23l 

natural, 

artificial 

476 classes 

1024 X1024 

color RGB 

0 

0 

0 

0 

0 

0 

0 

- 

USPTex [24l 

natural, 

artificial 

191 classes, 
2292 images 

512 X 384 

color RGB 

0 

0 

0 

0 

0 

0 

- 

- 

Proposed 

RawFooT 

natural, 

food 

68 classes, 
3128 images 

800 X 800 

color RGB 

0 

0 

• 9 

• 5 

• 18 

• 12 

frontal 

Canon 40D 


2.1 The Raw Food Texture database (RawFooT) 

The Raw Food Texture database (RawFooT) has been spe¬ 
cially designed to investigate the robustness of descriptors 
and classification methods with respect to variations in the 
lighting conditions, with a particular focus on variations in 
the color of the illuminant. The database includes images of 
samples of textures, acquired under 46 lighting conditions 
which may differ in the light direction, in the illuminant 
color, in its intensity, or in a combination of these factors. 

Psycho-physical studies (25J suggest that, in the human 
visual system, color and pattern information are processed 
separately However, it has been observed that their com¬ 
bination can be very effective for texture classification. For 
certain classes of materials the two kind of information are 
clearly independent (e.g. fabrics and other artificial materi¬ 
als). For this reason, we considered samples of texture where 
the relationship between pattern information and color has 
not been explicitly designed. Our classes correspond to 68 
samples of raw food, including various kind of meat, fish, 
cereals, fruit etc. Therefore, the whole database includes 
68 x 46 = 3128 images. Fig. [2] shows an image of each 
sample. 

Pictures have been acquired in a dark room with a 
Canon EOS 40D DSLR camera. The camera was placed 
48cm above the sample to be acquired, with the optical axis 
perpendicular to the surface of the sample. The lenses used 
had a focal length of 85mm, and a camera aperture of f/11.3; 
each picture has been taken with four seconds of exposition 


camera 



Fig. 3. The setup used to acquire the Raw Food Texture database. 


time. As illuminants, we used a pair of monitors (22 inches 
Samsung SyncMaster LED monitor) positioned above the 
sample and tilted by 45 degrees, with about 20cm of space 
between their upper edges to make room for the camera. By 
illuminating different regions of the monitors, and by using 
different colors (inspired by f26j) we simulated natural and 
artificial illuminants coming from different directions and 
at various intensity levels. The two monitors have been 
colorimetrically characterized using a X-Rite il spectral col¬ 
orimeter, in such a way that the device RGB coordinates 
can be used to accurately render the desired chromaticites. 
Daylight at 6500 K (D65) has been specified as a white point. 
Fig. [3] shows the setup used for the acquisitions. 

For each sample a program of 46 shots has been fol- 
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(1) chickpeas (2) corn (3) salt (4) cookie (5) lentils (6) candies (7) green peas (8) puffed rice (9) spelt (10) white peas (11) cous cous (12) sliced bread 



(13) apple slice (14) pearl barley (15) oat (16) black rice (17) quinoa (18) buckwheat (19) puffed rice (20) basmati rice (21) steak (22) fennel seeds (23) poppy seeds (24) brown sugar 



(25) sultana (26) coffee powder(27) polenta flour (28) salami (29) air-cured beef (30) flatbread (31) corn crackers (32) oregano (33) black beans (34) soluble coffee (35) hazelnut grain(36) flour 



(49) garlic grain 

■■■I 


(50) curry (51) pink pepper (52) kiwi (53) mango (54) pomegranate (55) currant (56) pumpkin seed(57) tea 



(58) red lentils (59) green adzuki (60) linseeds 


(61) coconut flakes (62) chicory (63) pork loin (64) chicken breast (65) carrots (66) sugar (67) salmon (68) tuna 


Fig. 2. Overview of the 68 classes included in the Raw Food Texture database. For each class it is shown the image taken under D65 at direction 

< 9 = 24 °. 


lowed: 

intensity variations: four shots have been taken while il¬ 
luminating the whole monitors with neutral light (D65) at 
different levels of intensity (100%, 75%, 50% and 25% of the 
maximum achievable level); 

light direction: nine shots have been taken with the light 
(D65) coming from different angles. In the first eight of these 
shots only a band covering 40% of a single monitor has been 
lit. The angle between the direction of the light coming from 
the center of the illuminated band and the surface of the 
sample are 24, 30, 36, 42, 48, 54, 60, and 66 degrees. For the 
last shot two bands covering the upper 20% of each monitor 
have been lit (since on average the light comes exactly from 
above the sample, we count it as an angle of 90 degrees). 
Daylight: 12 shots have been taken while simulating natural 
daylight at different color temperatures. To do so, given a 
color temperature T we applied the following equations to 
obtain the corresponding xy chromaticities: 

10 3 10 6 10 9 

x — a 0 + ai + a 2 + a 3 ^ 

y = — 3x 2 + 2.87x — 0.275, 

where a 0 = 0.244063, a\ — 0.09911, a 2 = 2.9678, 
a 3 = -4.6070 if 4000 K <T< 7000 K, and a 0 = 0.23704, 
ai = 0.24748, a 2 = 1.9018, a 3 = -2.0064 if 7000 K < T < 
25 000 K l27l . Chromaticities have been converted in the 
RGB space with a scaling of the color channels ensuring 
that their largest value is 255. We considered 12 color tem¬ 
peratures in the range from 4000 K to 9500 K with a step 
of 500 K (we will refer to these as D40, D45, ..., D95). The 
whole monitors have been lit during these shots. 

Indoor illumination: six shots have been taken while simu¬ 
lating an artificial light with a color temperature of 2700 K, 


3000 K, 4000 K, 5000 K, 5700 K and 6500 K on the two whole 
monitors. We considered LED lights produced by OSRAM 
and we computed the corresponding RGB values starting 
from the chromaticities indicated in the data sheets from the 
producer's web sit^J We will refer to these as L27, L30,..., 
L65. 

Color and direction: nine shots have been taken by varying 
both the color and the direction of the illuminant. The 
combinations of three colors (D65, D95 and L27) and of three 
directions (24, 60 and 90 degrees) have been considered. 
Multiple illuminants: three shots have been taken while 
the sample is illuminated by two illuminants with different 
colors (D65, D95 or L27). Bands covering the lower 40% of 
both the monitors have been lit, using two different colors 
on the two monitors. 

Primary colors: three shots have been taken under pure red, 
green and blue illuminants. 

Each 3944 x 2622 picture in the camera space has been 
converted to standard sRGB and the final texture images 
have been obtained by cropping the central region of 
800 x 800 pixels. Fig. [3] shows the 46 shots taken for two 
of the 68 samples. To allow the estimate of the illuminants 
we have carried out the program of 46 shots of a 24 squares 
Macbeth ColorChecker. 

3 Texture Descriptors 

A huge variety of texture descriptors have been proposed in 
the literature. These were traditionally divided into statis¬ 
tical, spectral, structural and hybrid approach (28). Among 
traditional methods the most popular are probably those 

1. http://www.osram-os.com 
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D65 (1=100%) D65 (1=75%) D65 (1=50%) D65 (1=25%) D65 (0=24°) D65 (0=30°) D65 (6»=36°) D65 (6»=42°) D65 (0=48°) D65 (0=54°) D65 (0=60°) D65 (0=66°) 



D65(0=9O°) D40 D45 D50 D55 D60 D65 D70 D75 D80 D85 D90 



D95 L27 L30 L40 L50 L57 L65 D65(0=24°) D95(0=24°) L27(0=24°) D65(0=6O°) D95(0=6O°) 



L27 (0=60°) D65 (0=90°) D95 (0=90°) L27(0=9O°) D65-D95 D65-L27 D95-L27 Red Green Blue 

Fig. 4. Overview of the 46 lighting conditions in the Raw Food Texture database: the top rows represent the flour class while bottom rows represent 
the currant class. 


based on histograms, Gabor filters |29j, cooccurrence matri¬ 
ces (301, and Local Binary Patterns [31] • These descriptors 
display different strengths and weaknesses in particular 
concerning their invariance with respect to the acquisition 
conditions. 

Traditional descriptors are often designed to capture 
texture information in uncluttered images taken under con¬ 
trolled conditions. To address those cases where the condi¬ 
tions cannot be controlled, a few attempts have been made 
to adapt features used for scene or object recognition to 
the domain of texture classification. For instance, Sharan et 
al. [9] used SIFT and HOG descriptors for material classi¬ 
fication, while Sharma et al. (32] used a variation of the 
Fisher Vector approach for texture and face classification. 
Cimpoi et al. (3 shown how SIFT descriptors aggregated 
with the improved Fisher vector method greatly outperform 
previous descriptors in the state of the art on a variety of 
texture classification tasks, including the classification of 
'Textures in the wild." 

Following the trend in image recognition, features ex¬ 
tracted from Convolutional Neural Networks (CNNs) have 
been adopted for texture classification as well. CNNs allow 
to leverage very large datasets of labeled images, by learn¬ 
ing intermediate image representations that can be used 
for various image classification problems [33l . For instance. 


Cimpoi et al. (U used Fisher Vectors to pool features com¬ 
puted by a CNN trained for object recognition. 

In addition to these general purpose texture descriptors, 
a variety of descriptors have been specially designed to be 
robust with respect to specific variations in the acquisition 
conditions. Khan et ol. ED, for instance, considered a di¬ 
agonal/offset model for illumination variations, deduced 
from it an image normalization transformation, and finally 
extracted Gabor features from the normalized images. Other 
color normalization techniques can be used for this pur¬ 
pose. Finlayson et al. proposed rank-based features obtained 
from invariant color representations [35]. Seifi et al., in¬ 
stead, proposed to characterize color textures by analyzing 
the rank correlation between pixels located in the same 
neighborhood. They obtained a correlation measure which 
is related to the colors of the pixels, and is not sensitive 
to illumination changes [36]. Cusano et al. [37] proposed a 
texture descriptor specially designed to deal with the case 
of variations in the color of the illuminant. The reader can 
refer to the work of Drbohlav et al. |38| for a comparative 
analysis of texture methods under varying viewpoint and 
illumination, and to the work of Kandaswamy et al. |39| for 
a comparison among texture analysis schemes under non¬ 
ideal conditions. 

In this work we compared several descriptors from the 
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state of the art, by taking a few representative descriptors for 
each of the approaches mentioned above. Several descrip¬ 
tors have been applied to both color and gray-scale images, 
where the gray-scale image is defined as the luminance of 
the image and is obtained by using the standard formula: 
L = 0.299# + 0.587G + 0.114#. 

In order to make the results readable we consider, here, 
only a selection of all the descriptors evaluated. 

3.1 Traditional descriptors 

• 256-dimensional gray-scale histogram; 

• 512-dimensional Hue and Value marginal histogram 
obtained from the HSV color representation of the 
image; 

• 768-dimensional RGB and rgb marginal his¬ 
tograms l40l ; 

• 10-dimensional feature vector composed of normal¬ 
ized chromaticity moments as defined in |4H : 

• 15-dimensional feature vector composed of contrast, 
correlation, energy, entropy and homogeneity ex¬ 
tracted from the co-occurrence matrices of each color 
channel El , |43|; 

• 144-dimensional Gabor features composed of mean 
and standard deviation of six orientations extracted 
at four frequencies for each color channel (44l , [29J; 

• 264-dimensional opponent Gabor feature vector ex¬ 
tracted as Gabor features from several inter/intra 
channel combinations: monochrome features ex¬ 
tracted from each channel separately and opponent 
features extracted from couple of colors at different 
frequencies [45j; 

• 54-dimensional Dual Tree Complex Wavelet Transform 
(DT-CWT) features obtained considering four scales, 
mean and standard deviation, and three color chan¬ 
nels [44], |46|; 

• 26-dimensional feature vector obtained calculating 
morphological operators {granulometries ) at four an¬ 
gles and for each color channel |47]; 

• 512-dimensional Gist features obtained considering 
eight orientations and four scales for each chan¬ 
nel l48l ; 

• 81-dimensional Histogram of Oriented Gradients 
(HOG) feature vector l49l . Nine histograms with nine 
bins are concatenated to achieve the final feature 
vector; 

• 243-dimensional Local Binary Patterns (LBP) feature 
vector computed with 16 neighbors, radius two and 
uniform patterns. We applied LBP to gray-scale im¬ 
ages and then also to the color channels RGB, CIE- 
Lab and Ohta's I 1 I 2 I 3 spaces (in these cases the vector 
will be 729-dimensional) |T); 

• Combination of LBP computed on pairs of 
color channels, namely the Opponent Color LBP 
(OCLBP) (SOI; 

• LBP combined with the Local Color Contrast descrip¬ 
tor, as described in J37); 

• 499-dimensional Local Color Contrast feature vec¬ 
tor. It is obtained by concatenating the LBP on the 
gray images with a quantized measure of color con¬ 
trast l37l ; 


3.2 Descriptors for object recognition 

The features considered here consists in the aggregation of 
local descriptors according to the quantization defined by 
a codebook of visual words. As local descriptors we used 
128-dimensional dense SIFT obtained from the gray-scale 
image by considering a spatial histogram of local gradient 
orientations. The spatial bins have an extent of 6 x 6. The 
descriptors have been sampled every two pixel and at scales 
2*/ 3 , £ = 0,1,2,- 

The object recognition features differ for the aggrega¬ 
tion method, but all of them are based on a codebook of 
1024 visual words built on images from external sources. 
In particular we downloaded 20 000 images from Flickr 
containing various content, such as sunset, countryside, etc. 
and we used k -means to find 1024 representative vectors. 

The object recognition features considered here are: 

• 1024-dimensional bag of visual words (BoVW). 

• 25 600-dimensional vector of locally aggregated de¬ 
scriptors (vlad) (7). 

• 40 960-dimensional Fisher's vectors (fv) of locally 
aggregated descriptors l5H . 

3.3 CNN-based descriptors 

The CNN-based features have been obtained as the interme¬ 
diate representations of deep convolutional neural networks 
originally trained for object recognition. The networks are 
used to generate a texture descriptor by removing the final 
softmax nonlinearity and the last fully-connected layer, re¬ 
sulting in feature vectors which are L 2 normalized before 
being used for classification. We considered the most rep¬ 
resentative CNN architectures in the state of the art 1521 , 
each exploring a different accuracy/speed trade-off. All the 
CNNs have been trained on the ILSVRC-2012 dataset using 
the same protocol as in (53). In particular we considered 
4096, 2048, 1024 and 128-dimensional feature vectors as 
follows lf33l : 

• BVLC AlexNet (BVLC AlexNet): AlexNet trained on 
ILSVRC 2012 (53). 

• BVLC Reference CaffeNet (BVLC Ref): AlexNet trained 
on ILSVRC 2012, with a minor variation from the 
version as described in (53). 

• Fast CNN (Vgg F): it is similar to the one presented 
in m with a reduced number of convolutional 
layers and the dense connectivity between convolu¬ 
tional layers. The last fully-connected layer is 4096- 
dimensional ED. 

• Medium CNN (Vgg M): it is similar to the one pre¬ 
sented in tan with a reduced number of filters in 
the convolutional layer four. The last fully-connected 
layer is 4096-dimensional (54l . 

• Medium CNN (Vgg M-2048-1024-128): three modifica¬ 
tions of the Vgg M network, with lower dimensional 
last fully-connected layer. In particular we used a 
feature vector of 2048,1024 and 128 size [54]. 

• Slow CNN (Vgg S): it is similar to the one presented 
in m with a reduced number of convolutional 
layers, less filters in the layer five and the Local 
Response Normalization. The last fully-connected 
layer is 4096-dimensional (54l . 
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• Vgg Very Deep 19 and 16 layers (Vgg VeryDeep 16 
and 19): the configuration of these networks has been 
achieved by increasing the depth to 16 and 19 layers, 
that results in a substantially deeper network than 
what has been used in previous studies l57l . 

3.4 Color normalization 

Invariance with respect to specific changes in acquisition 
conditions, such as those caused by variations in the illu¬ 
mination, is an important property of visual descriptors. 
Illumination variations can be also compensated by prepro¬ 
cessing images with a color normalization method. Color 
normalization methods try to assign a constant color to 
objects acquired under different illumination conditions. 

In order to evaluate this strategy, we have preprocessed 
the RawFooT database by using several existing normal¬ 
ization methods and next we have extracted features by 
using the best color descriptors from the set of descriptors 
evaluated in table [2] More precisely, we considered two im¬ 
plementations of the Retinex method described in lf58l that 
improve the computational efficiency while preserving the 
underlying principles: the McCann99 |59| and the Frankle- 
McCann m. Furthermore, we considered the Gray World 
|61j, two variants of edge based algorithm, the Gray-Edge 
[62][ and the weighted Gray-Edge method |63j. 

4 Experiments 

In all the experiments we used the nearest neighbor classi¬ 
fication strategy: given a patch in the test set, its distance 
with respect to all the training patches is computed. The 
prediction of the classifier is the class of the closest element 
in the training set. For this purpose, after some preliminary 
tests with several descriptors in which we evaluated the 
most common distance measures, we decided to use the 
LI distance: d(x, y) = l x i — yi|, where x and y 

are two feature vectors. All the experiments have been 
conducted under the maximum ignorance assumption, that 
is, no information about the lighting conditions of the test 
patches is available for the classification method and for the 
descriptors. Performance is reported as classification rate 
(i.e., the ratio between the number of correctly classified 
images and the number of test images). Note that more 
complex classification schemes (e.g. SVMs) would have 
been viable. We decided to adopt the simplest one in order 
to focus the evaluation on the features themselves and not 
on the classifier. 

4.1 RawFooT database setup 

For each of the 68 classes we considered 16 patches ob¬ 
tained by dividing the original texture image, that is of 
size 800 x 800 pixels, in 16 non-overlapping squares of size 
200 x 200 pixels. For each class we selected eight patches 
for training and eight for testing alternating them in a 
chessboard pattern. We form subsets of 68 x (8 + 8) = 1088 
patches by taking the training and test patches from images 
taken under different lighting conditions. 

In this way we defined several subsets, grouped in nine 
texture classification tasks. 


1) No variations: 46 subsets. Each subset is composed 
of training and test patches taken under the same 
lighting condition. 

2) Light intensity: 12 subsets obtained by combining 
the four intensity variations. Each subset is com¬ 
posed of training and test patches with different 
light intensity values. 

3) Light direction: 72 subsets obtained by combining 
the nine different light directions. Each subset is 
composed of training and test patches with different 
light direction. 

4) Daylight temperature: 132 subsets obtained by 
combining all the 12 daylight temperature varia¬ 
tions. Each subset is composed of training and test 
patches with different light temperatures. 

5) LED temperature: 30 subsets obtained by combin¬ 
ing all the six LED temperature variations. Each 
subset is composed of training and test patches with 
different light temperatures. 

6) Daylight vs. LED: 72 subsets obtained by combin¬ 
ing 12 daylight temperatures with six LED temper¬ 
atures. 

7) Temperature or direction: 72 subsets obtained by 
combining all the nine combinations of color tem¬ 
peratures and light directions. Each subset is com¬ 
posed of training and test patches where either the 
color or the direction (or both) change. 

8) Temperature and direction: 36 subsets obtained 
by combining all the nine combinations of color 
temperatures and light directions. Each subset is 
composed of training and test patches where both 
the color and the direction change. 

9) Multiple illuminant: six subsets obtained by com¬ 
bining the three acquisitions with multiple illumi¬ 
nant s. 

5 Results 

Table [2] reports the performance obtained by the descriptors 
considered as average and minimum accuracy over the nine 
classification tasks. For the four main tasks ( same illuminant, 
light intensity, light direction and daylight temperature ) the 
results are shown in greater detail in Figure |5j but only for 
some representative methods. When training and test are 
taken under the same lighting conditions the classification 
rates are generally high, regardless the specific conditions. 
CNN features perform very well, with a peak of 98.2% of 
accuracy obtained with the features extracted by the Vgg 
Very Deep 16 network. Other, more traditional features per¬ 
form very well in this scenario (OCLBP at 95.9% Opp. Gabor 
at 96.2%...) and even simple rgh histograms achieve an 
accuracy of 97.2%. It is clear that under fixed conditions, 
texture classification is not a very challenging problem, see 
also Figure |5|a). 

When training and test patches are taken under variable 
intensity, the behavior of CNN features and of the descrip¬ 
tors taken from the object recognition literature (BoVW) is 
very stable. Surprisingly, traditional hand-crafted features are 
heavily affected by this kind of variations even when they 
are supposed to be robust to them, as should be the case 
of LBP and Gabor-based features. This behavior is more 




Fig. 5. Detail of the classification rates as functions of the amount of variability in the illumination conditions between the training and test set. (a) 
Accuracy obtained in the no variations classification task (each point corresponds to one of the 46 shots), (b) Accuracy with respect to the difference 
A I of light intensity, (c) Accuracy obtained varying the difference between the direction of the light, (d) Accuracy with respect to the difference AT 
of Daylight temperature. 


evident looking at Figure (^b), where only Vgg VeryDeep 16 
and BoVW have flat curves over changes in the intensity of 
the light. 

Perhaps one of the most challenging variation to take 
into account is that related to the direction of the light, 
see Figure j5|c). In this task all the descriptors suffered a 
noticeable decrease in performance. However, some CNN 
features remained, on average, above 90% of accuracy. The 
performance of all the other features dropped below 70%. 

When the illuminant color is allowed to vary between 
train and test images, the achromatic features are the least 
affected. In particular, the features from the object recogni¬ 
tion literature obtained the same performance of the same 
illumination task. For other features, such as LBP-L, we 
observed a decrease in the performance, probably due to 
the variation in intensity caused by the change of the color 
temperature. Features that use color information greatly 
suffer this kind of variability (see Figure 5jd)). The most 
important exception is represented by the CNN features that 
have been trained to exploit color, but in such a way to be 
robust with respect to the large amount of variability in the 
object categories they were supposed to discriminate. 

Very low performance have been obtained when both 
direction and color change simultaneously. In this case the 
best results have been obtained by, again, features from 
CNNs. However, the highest classification rate is quite low 
(about 63.6%) and most networks do not allow to achieve 
more than 50%. The results for the other features are even 
worse than that. 

The last task involved the presence of multiple illumi- 
nants. Since their position was stable, we obtained similar 
results of those of the case of variable color temperature. 


Summing up, the challenges of recognizing textures 
under variable illumination conditions greatly depends on 
the type of variability involved in the experiments. Features 
extracted by CNNs significantly outperform the other de¬ 
scriptors considered. Features from the object recognition 
literature clearly outperform traditional hand-crafted texture 
features in all the scenarios considered. Only under some 
specific circumstances these last ones outperformed CNN 
features. For instance, in Figure [5] (d) it can be observed that 
CNN features fall below the bag of visual words descriptor 
for extreme variations in the color temperature. This circum¬ 
stances can be better understood by looking at Fig.[6ja). This 
figure compares the behavior of Vgg Very Deep 16 and BoVW 
over the 68 classes. Here the training and test images have 
been taken under different lights but all directed with the 
same angle. In particular we averaged the accuracy obtained 
in three sets of experiments, one for each angle (24,60,90 
deg) all including three lights: D65, D95 and L27. It is quite 
evident that in this case object recognition features outper¬ 
form CNNs especially for those classes whose appearance 
is most sensitive to color changes and that contain a more 
fine-grained texture, such as (3) salt, (12) sliced bread, (36) 
flour, (53) mango, (61) coconut flakes, (66) sugar, etc. This result 
is due to the fact the CNNs have mainly been trained on 
images of objects and thus containing more coarse details. In 
contrast, when the training and test images have been taken 
under the same light but with different light directions, 
CNN features demonstrate to be more robust than object 
recognition features, see Fig. |6jb). Here the worst results are 
obtained by BoVW on coarse-grained texture images, such 
as (1) chickpeas, (20) basmati rice, (62) chicory, etc. 
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Fig. 6. Accuracy of the Vgg VeryDeep 16 (turquoise) and BoVW (black) over the 68 classes, (a) training and test images are under different lights 
and same angle, (b) training and test images are under the same light and different angles. To map the numbers to the corresponding classes see 
Figure[2] 


5.1 Preprocessing with color normalization 

We have preprocessed all the images with five state-of-the- 
art color normalization methods. Examples of preprocessing 
applied to two different samples are represented in Figure [7] 

Table [3] reports the performance obtained by these color 
normalization methods combined with a selection of de¬ 
scriptors. It is clear that color normalization helps to im¬ 
prove performance in the case of CNNs. In particular, 
the combination of Vgg VeryDeep 16 with Retinex Frankie 
achieves an improvement of 5% in both the cases of temper¬ 
ature and/or direction variations. This result confirms the 
fact that CNNs have been trained on images without con¬ 
sidering changes in illumination conditions. In contrast, the 
combination of hand-crafted features with pre-processing 
methods, in most of the cases, does not bring any improve¬ 
ments in terms of classification rate. This is due to the fact 
that those features, except for color histogram, have been 
designed to be more robust to changes in the temperature 
of the light. 

6 Summary 

In order to obtain reliable classification of color textures 
under uncontrolled conditions, we believe that the descrip¬ 
tors performance should be assessed under a large set of 
carefully controlled variations of lighting conditions. We 
described RawFooT, a database of texture images acquired 
under variable light direction, color, and intensity. The im¬ 
ages of the database will be made publicly available together 
with all the scripts used in our experimentation. We will also 
disclose the detailed technical specifications of the hardware 
and software used to acquire the database; this will allow 
the researchers in this area to extend RawFooT or to acquire 
their own database. 

RawFooT allowed us to conduct a variety of experiments 
in which traditional texture, object recognition and CNN- 
based descriptors have been evaluated in terms of their 


capabilities in dealing with single and combined variations 
in the lighting conditions. These experiments made very 
clear the strengths and the weaknesses of the investigated 
approach and clearly outlined open issues that should be 
addressed to actually design color texture descriptors robust 
with respect to unknown variations in the imaging condi¬ 
tions. 

In extreme summary, we can conclude that: 

• Traditional texture descriptors are effective only when 
images have no-variations in lighting conditions. 

• Object recognition descriptors demonstrated to per¬ 
form, in most of the cases, better than the traditional 
ones. 

• CNN-based descriptors confirmed to be powerful also 
on texture classification tasks outperforming the 
hand-crafted traditional and object-oriented features. 

• CNN-based descriptors handle most of the variations 
in lighting conditions. However for large variations 
in both color and direction of the light, CNN-based 
descriptors have demonstrated to be less effective 
than object recognition descriptors, especially on 
these classes that are more fine grained. 

• The use of Color normalization did not improve any 
of the hand-crafted descriptors, while for CNN-based 
descriptors they demonstrated to be helpful in dealing 
with complex variations in illumination conditions. 
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TABLE 2 

Classification rates (%) of the texture descriptors considered. For each classification task, the best result is reported in bold. 
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Features 

No variations 
avg (min) 

Light intensity 
avg (min) 

Light direction 
avg (min) 

Daylight temp, 
avg (min) 

LED temperature 
avg (min) 

Daylight vs. LED 
avg (min) 

Temp. OR Dir. 
avg (min) 

Temp. & Dir. 
avg (min) 

Multi-ilium 
avg (min) 

Hist. L 

78.32 (60.66) 

6.77 (1.47) 

20.97 (2.21) 

49.94 (11.95) 

27.18 (5.88) 

38.05 (6.43) 

10.45 (1.29) 

7.98 (1.29) 

47.33 (30.15) 

Hist. H V 

96.38 (84.56) 

31.45 (14.52) 

51.01 (13.97) 

49.11 (9.93) 

51.56 (23.35) 

44.39 (9.19) 

16.47 (4.23) 

11.36 (4.23) 

49.82 (38.42) 

Hist. RGB 

94.93 (87.13) 

15.89 (3.12) 

40.82 (5.70) 

56.45 (18.20) 

37.51 (12.68) 

43.44 (8.00) 

15.53 (2.76) 

10.21 (2.76) 

53.83 (42.28) 

Hist, rgb 

97.24 (92.46) 

67.08 (36.95) 

64.07 (24.63) 

37.35 (6.43) 

17.38 (3.31) 

25.71 (5.15) 

20.16 (2.39) 

12.57 (2.39) 

45.25 (16.36) 

Chrom. mom. 

82.54 (58.46) 

68.43 (48.90) 

53.68 (22.24) 

33.41 (4.96) 

18.66 (3.68) 

24.16 (5.06) 

17.03 (2.21) 

10.63 (2.21) 

33.92 (20.77) 

Coocc. matr. 

35.33 (9.93) 

7.20 (2.02) 

9.39 (0.55) 

23.02 (9.74) 

19.01 (6.62) 

19.88 (5.61) 

3.30 (0.18) 

2.89 (0.18) 

10.48 (6.07) 

Coocc. matr. L 

18.68 (1.47) 

3.32 (0.00) 

5.63 (0.55) 

16.99 (6.99) 

9.49 (3.31) 

12.94 (2.85) 

2.49 (0.00) 

2.57 (0.00) 

6.86 (2.57) 

DT-CWT 

92.26 (81.62) 

21.68 (1.65) 

49.48 (12.32) 

66.29 (25.92) 

42.31 (14.34) 

49.77 (15.44) 

19.23 (3.12) 

13.22 (3.12) 

65.93 (55.33) 

DT-CWT L 

72.85 (58.09) 

10.65 (1.29) 

29.29 (5.15) 

60.13 (27.39) 

32.70 (4.04) 

44.06 (5.06) 

14.70 (1.47) 

12.92 (1.47) 

52.73 (37.13) 

Gabor RGB 

93.02 (61.76) 

66.96 (32.35) 

51.66 (11.95) 

64.81 (20.77) 

38.13 (12.13) 

48.03 (12.59) 

27.18 (3.49) 

16.43 (3.49) 

75.18 (59.93) 

Gabor L 

72.91 (70.04) 

46.57 (18.75) 

29.34 (3.86) 

68.94 (59.56) 

67.62 (58.82) 

66.86 (53.40) 

27.58 (2.57) 

16.09 (2.57) 

74.33 (72.79) 

Opp. Gabor RGB 

i 96.15 (59.38) 

21.51 (3.49) 

50.61 (12.13) 

67.75 (22.98) 

41.78 (14.34) 

50.80 (15.07) 

20.22 (3.86) 

13.47 (3.86) 

58.58 (43.01) 

Gist RGB 

66.20 (62.50) 

55.06 (31.99) 

45.75 (10.85) 

55.49 (28.31) 

36.78 (13.24) 

43.41 (13.79) 

25.13 (2.76) 

17.30 (2.76) 

60.36 (54.78) 

Granulometry 

91.98 (51.65) 

63.73 (27.76) 

48.65 (10.66) 

69.80 (21.51) 

33.58 (6.80) 

48.79 (6.34) 

22.20 (1.65) 

12.86 (1.65) 

74.23 (64.34) 

HoG 

46.74 (43.20) 

37.52 (24.82) 

27.48 (8.46) 

41.14 (29.60) 

35.29 (22.24) 

36.30 (19.30) 

16.99 (3.49) 

11.71 (3.49) 

43.66 (40.62) 

LBP L 

80.37 (77.02) 

51.15 (17.83) 

47.09 (9.19) 

77.76 (72.24) 

70.77 (54.60) 

73.15 (55.06) 

29.54 (5.51) 

18.66 (5.51) 

76.99 (74.26) 

LBP RGB 

93.55 (90.81) 

68.87 (33.46) 

59.48 (13.60) 

72.40 (24.63) 

48.39 (15.07) 

56.08 (16.82) 

23.72 (0.55) 

14.19 (0.55) 

76.81 (67.10) 

LBP Lab 

92.90 (88.42) 

71.88 (32.54) 

61.74 (17.28) 

70.61 (24.08) 

51.53 (21.69) 

56.00 (19.49) 

27.55 (3.31) 

18.06 (3.31) 

77.21 (71.32) 

LBPIiI 2 I 3 

91.40 (82.90) 

66.28 (28.12) 

63.05 (17.10) 

70.58 (25.92) 

49.90 (18.38) 

54.76 (17.00) 

27.05 (1.10) 

18.39 (1.10) 

75.55 (64.89) 

OCLBP 

95.92 (92.28) 

78.75 (51.47) 

65.70 (14.52) 

67.92 (19.67) 

49.94 (15.81) 

53.93 (15.81) 

25.73 (1.65) 

16.99 (1.65) 

76.53 (50.00) 

LCC 

92.92 (88.60) 

62.64 (26.84) 

56.15 (12.13) 

88.78 (73.71) 

74.25 (46.88) 

78.82 (50.64) 

31.13 (5.15) 

19.85 (5.15) 

85.91 (84.38) 

BoVW 

89.73 (86.58) 

87.38 (81.43) 

67.38 (16.18) 

90.02 (88.05) 

88.87 (86.76) 

89.53 (87.68) 

51.59 (12.68) 

39.34 (12.68) 

88.60 (87.68) 

VLAD 

79.29 (75.18) 

76.87 (70.22) 

64.73 (20.77) 

78.70 (76.65) 

77.87 (73.90) 

78.31 (75.74) 

51.44 (18.20) 

42.62 (18.75) 

80.58 (79.78) 

FV 

85.59 (80.51) 

81.02 (71.32) 

69.31 (27.39) 

86.57 (84.38) 

84.58 (78.68) 

85.51 (82.35) 

55.26 (20.04) 

45.95 (20.04) 

85.69 (84.56) 

VggF 

96.94 (89.34) 

88.07 (72.61) 

83.23 (47.06) 

96.27 (86.95) 

87.94 (65.26) 

89.40 (62.96) 

54.89 (13.42) 

45.62 (13.42) 

95.59 (94.49) 

VggM 

97.50 (89.89) 

89.94 (76.29) 

86.14 (52.02) 

97.30 (88.97) 

90.64 (69.85) 

91.91 (71.60) 

60.25 (20.04) 

51.13 (20.04) 

96.23 (94.49) 

VggMS 

97.53 (91.36) 

90.93 (74.82) 

86.24 (51.10) 

96.16 (85.66) 

89.90 (68.75) 

90.33 (69.21) 

59.02 (16.73) 

49.48 (16.73) 

95.53 (93.57) 

Vgg M 2048 

97.27 (89.89) 

89.08 (72.43) 

84.74 (48.35) 

97.38 (90.81) 

90.06 (64.34) 

91.41 (69.58) 

57.18 (14.15) 

47.89 (14.15) 

95.01 (93.38) 

Vgg M 1024 

96.79 (87.87) 

88.53 (69.67) 

84.15 (49.08) 

96.90 (90.07) 

90.14 (63.42) 

91.47 (69.76) 

56.26 (14.15) 

47.26 (14.15) 

93.75 (91.91) 

Vgg M 128 

93.50 (78.31) 

81.54 (59.93) 

78.04 (45.04) 

93.01 (82.35) 

79.72 (46.69) 

83.20 (50.92) 

49.13 (13.79) 

40.75 (13.79) 

89.00 (87.50) 
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83.23 (60.66) 

79.00 (40.81) 

94.51 (80.33) 

85.11 (58.64) 

86.52 (58.00) 

49.50 (9.38) 

40.63 (9.38) 

91.24 (89.15) 

BVLC AlexNet 

94.88 (82.17) 

85.00 (65.07) 

80.25 (43.57) 

95.79 (86.03) 

88.56 (66.36) 

89.64 (65.99) 

48.08 (9.38) 

39.37 (9.38) 

90.44 (88.24) 

Vgg VeryDeep 16 98.21 (92.83) 

94.10 (85.48) 

91.23 (66.73) 

97.41 (86.21) 

93.69 (77.57) 

93.67 (75.83) 

70.81 (33.82) 

63.64 (33.82) 

96.60 (94.85) 

Vgg VeryDeep 19 97.69 (92.10) 
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95.59 (94.49) 
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TABLE 3 

Classification rates (%) of a selection of color descriptors combined with different preprocessing methods. 
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Features 

No variations 
avg (min) 

Light intensity Light direction Daylight temp, 
avg (min) avg (min) avg (min) 

LED temperature Daylight vs. LED 
avg (min) avg (min) 

Temp. OR Dir. 
avg (min) 

Temp. & Dir. 
avg (min) 

Multi-ilium 
avg (min) 

VGG VeryDeep 16 

98.21 (92.83) 

94.10 (85.48) 

91.23 (66.73) 

97.41 (86.21) 

93.69 (77.57) 

93.67 (75.83) 

70.81 (33.82) 

63.64 (33.82) 

96.60 (94.85) 

Retinex McCann 

98.84 (94.67) 

94.04 (86.40) 

92.83 (67.65) 

96.72 (80.88) 

93.43 (78.68) 

93.04 (77.11) 

72.47 (33.46) 

65.70 (33.46) 

96.97 (96.14) 

Retinex Frankie 

98.91 (94.49) 

94.47 (87.32) 

93.49 (70.22) 

97.13 (83.46) 

94.20 (79.41) 

93.96 (79.87) 

75.33 (40.81) 

68.96 (40.81) 

97.40 (96.32) 

Gray-World 

98.24 (87.32) 

98.35 (95.59) 

91.30 (62.13) 

96.26 (83.09) 

78.20 (46.51) 

83.57 (45.96) 

59.85 (24.26) 

50.20 (24.26) 

96.17 (94.49) 

Gray-Edge 

98.64 (89.52) 

90.87 (81.43) 

90.93 (66.18) 

97.56 (92.28) 

93.21 (81.43) 

94.09 (81.43) 

73.82 (41.18) 

66.76 (41.18) 

96.78 (95.77) 

Weighted Gray-Edge 

98.37 (85.11) 

91.18 (78.31) 

90.69 (65.26) 

97.62 (93.01) 

93.24 (81.99) 

94.18 (81.16) 

73.57 (41.36) 

66.45 (41.36) 

96.51 (94.49) 

LBP RGB 

93.55 (90.81) 

68.87 (33.46) 

59.48 (13.60) 

72.40 (24.63) 

48.39 (15.07) 

56.08 (16.82) 

23.72 (0.55) 

14.19 (0.55) 

76.81 (67.10) 

Retinex McCann 

94.07 (91.18) 

69.61 (38.24) 

60.77 (13.60) 

76.21 (31.80) 

56.59 (26.47) 

61.71 (23.71) 

28.42 (2.94) 

18.16 (2.94) 

82.69 (76.47) 

Retinex Frankie 

94.21 (90.62) 

68.37 (33.64) 

58.37 (14.71) 

71.99 (24.45) 

47.73 (16.54) 

55.40 (16.54) 

24.71 (2.21) 

14.92 (2.21) 

75.46 (64.71) 

Gray-World 

93.63 (90.81) 

80.91 (62.68) 

61.94 (13.42) 

77.88 (37.68) 

47.09 (12.68) 

58.19 (13.51) 

27.10 (0.74) 

17.11 (0.74) 

72.40 (63.05) 

Gray-Edge 

94.03 (91.18) 

62.96 (27.02) 

58.66 (14.15) 

72.79 (30.33) 

44.01 (10.66) 

54.02 (13.60) 

22.75 (0.37) 

13.30 (0.37) 

75.80 (65.99) 

Weighted Gray-Edge 

93.93 (81.62) 

63.45 (26.65) 

58.87 (13.97) 

72.83 (27.76) 

44.09 (10.85) 

53.98 (14.25) 

22.95 (0.55) 

13.40 (0.55) 

76.29 (67.28) 

Hist rgb 

97.24 (92.46) 

67.08 (36.95) 

64.07 (24.63) 

37.35 (6.43) 

17.38 (3.31) 

25.71 (5.15) 

20.16 (2.39) 

12.57 (2.39) 

45.25 (16.36) 

Retinex McCann 

98.66 (95.77) 

65.40 (38.60) 

57.01 (18.93) 

32.79 (7.17) 

17.54 (2.76) 

23.76 (5.70) 

16.22 (2.21) 

10.06 (2.39) 

38.91 (28.49) 

Retinex Frankie 

98.82 (95.77) 

66.42 (39.52) 

57.47 (19.67) 

34.81 (6.99) 

17.95 (3.12) 

24.45 (5.24) 

16.73 (2.76) 

10.15 (2.76) 

41.42 (28.49) 

Gray-World 

98.81 (96.32) 

47.87 (19.30) 

36.08 (5.70) 

51.90 (10.48) 

22.42 (1.10) 

35.12 (0.46) 

13.41 (0.00) 

8.36 (0.00) 

27.24 (10.66) 

Gray-Edge 

98.36 (96.32) 

78.80 (59.38) 

70.90 (31.80) 

64.46 (17.10) 

37.95 (8.46) 

46.55 (9.47) 

33.42 (6.25) 

24.51 (6.25) 

58.92 (35.29) 

Weighted Gray-Edge 

98.16 (84.01) 

75.97 (55.15) 

69.37 (29.60) 

64.62 (18.38) 

38.65 (9.74) 

46.90 (8.46) 

33.08 (6.99) 

24.59 (6.99) 

52.30 (30.88) 

Gabor RGB 

93.02 (61.76) 

66.96 (32.35) 

51.66 (11.95) 

64.81 (20.77) 

38.13 (12.13) 

48.03 (12.59) 

27.18 (3.49) 

16.43 (3.49) 

75.18 (59.93) 

Retinex McCann 

92.91 (89.34) 

66.88 (32.90) 

51.08 (12.68) 

65.87 (22.24) 

37.55 (10.66) 

48.24 (11.40) 

22.96 (0.92) 

13.12 (0.92) 

75.80 (64.71) 

Retinex Frankie 

93.57 (90.07) 

67.31 (33.27) 

51.51 (12.68) 

66.02 (21.88) 

37.71 (11.03) 

48.50 (11.95) 

23.24 (0.92) 

13.15 (0.92) 

75.77 (64.15) 

Gray-World 

94.06 (92.28) 

70.54 (37.32) 

52.43 (13.42) 

68.40 (23.35) 

40.29 (11.76) 

50.77 (12.68) 

26.27 (2.02) 

15.18 (2.02) 

78.40 (67.46) 
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93.46 (60.85) 

64.57 (31.43) 

51.51 (11.95) 

66.05 (22.79) 
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